Neal-Montgomery NLP System Evaluation Methodology

نویسنده

  • Sharon M. Walter
چکیده

On what basis are the input processing capabilities of Natural Language software judged? That is, what are the capabilities to be described and measured, and what are the standards against which we measure them? Rome Laboratory is currently supporting an effort to develop a concise terminology for describing the linguistic processing capabilities of Natural Language Systems, and a uniform methodology for appropriately applying the terminology. This methodology is meant to produce quantitative, objective profiles of NL system capabilities without requiring system adaptation to a new test domain or text corpus. The effort proposes to develop a repeatable procedure that produces consistent results for independent evaluators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Better NLP System Evaluation

This paper considers key elements of evaluation methodology, indicating the many points involved and advocating an unpacking approach in specifying an evaluation remit and design. Recognising the importance of both environment variables and system parameters leads to a grid organisation for tests. The paper illustrates the application of these notions through two examples. 1. I n t r o d u c t ...

متن کامل

Test Suites for Quality Evaluation of NLP Products

Test suites are a useful evaluation tool for developers and users of NLP products. The paper gives an overview of the tsnlp design and methodology and describes how the tsnlp data and methodology can be used in practice to provide a reliable assessment method of the linguistic capabilities of NLP products.

متن کامل

Evaluation of the NLP Components of an Information Extraction System for German

This paper describes ongoing work on the evaluation of the NLP components of the core engine of smes (Saarbrücker Message Extraction System), which consists of a tokenizer, an efficient and robust German morphology, a part-of-speech (POS) tagger, a shallow parsing module, a linguistic knowledge base and an output construction component. Currently the morphology, the tagger and a parsing module ...

متن کامل

Language Systems, Inc.: MUC-4 test results and analysis

LSI's overall natural language processing (NLP) objective is the development of a broad coverage, reusable system which is readily transportable to additional domains, applications, and sublanguages in English, as well as providing a foundation for our multilingual work . Our system, called DBG, for Data Base Generator, is comprised of a set of NLP components which have been developed, extended...

متن کامل

A methodology and tool suite for evaluation of accuracy of interoperating statistical natural language processing engines

Evaluation of accuracy of natural language processing (NLP) engines plays an important role in their development and improvement. Such evaluation usually takes place at a per-engine level. For example, there are evaluation methods for engines such as speech recognition, machine translation, story boundary detection, etc. Many real-world applications require combinations of these functions. This...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1992